Word sketch lexicography: new perspectives on lexicographic studies of Chinese near synonyms

نویسندگان

  • Shan Wang
  • Chu-Ren Huang
چکیده

Comparative study of near synonyms is one of the most productive research paradigms in Chinese lexicography. Empirical studies to discriminate near synonyms are either introspection-based or corpus-based. Yet, due to the large quantity of data in a corpus, lexicological studies of Chinese rarely make full use of the corpus data. To solve this problem, Kilgarriff’s Word Sketch Engine is designed to automatically obtain grammatical and collocational relations of target words from corpora for researchers to further analyze them. Chinese Word Sketch (CWS), a language specific version of Word Sketch Engine, provides a tool to automatically identify grammatical information for Gigaword size corpora. Through a comparative study of the synonymous emotion words 愉快 yúkuài 'pleasant' and 高興 gāoxìng 'happy', this paper illustrates how CWS can distinguish them and help lexicographers to discriminate their subtle differences. In particular, it focuses on the context where these synonymous words can be used to define each other and context where they should be differentiated. It also discusses how to select information from CWS such that the information represented would be suitable for lexicographic studies. Through the study of near synonyms, we propose that Word Sketch Lexicography will lead the next generation of dictionaries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Sketch Engine

Word sketches are one-page automatic, corpus-based summaries of a word’s grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary and were presented at Euralex 2002. At that point, they only existed for English. Now, we have developed the Sketch Engine, a corpus tool which takes as input a corpus of any language and a corresponding gram...

متن کامل

Polish Word Sketches

Word sketches are one-page automatic, corpus-based summaries of a word's grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary (Rundell 2002). At that point, word sketches only existed for English. Today, the Sketch Engine is available, a corpus tool which takes as input a corpus of any language and corresponding grammar patterns and...

متن کامل

Slovene Word Sketches

Word sketches are one-page automatic, corpus-based summaries of a word's grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary (Rundell 2002). At that point, they only existed for English. Today, the Sketch Engine is available, a corpus tool which takes as input a corpus of any language and corresponding grammar patterns and which ge...

متن کامل

Hindi Word Sketches

Word sketches are one-page automatic, corpus-based summaries of a word’s grammatical and collocational behaviour. These are widely used for studying a language and in lexicography. Sketch Engine is a leading corpus tool which takes as input a corpus and generates word sketches for the words of that language. It also generates a thesaurus and ‘sketch differences’, which specify similarities and ...

متن کامل

Building Russian Word Sketches as Models of Phrases

The paper describes the writing of Sketch Grammar for the Russian language as a part of the Sketch Engine system. The Sketch Engine representing itself a corpus tool which takes as input a corpus of any language and corresponding grammar patterns. The system gives information about a word’s collocability on concrete dependency models, and generates lists of the most frequent phrases for a given...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017